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BLAST Basic Local Alignment Search Tool 
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Protein Sequence (154 letters) 

Results for: pci|5777cf KioneO 54aaf]3 

Your BLAST job specified more than one input sequence. This box lets you choose which input sequence 
to show BLAST results for. 

Query ID 

lcl|57770 
Description 
None 
Molecule type 

amino acid 
Query Length 
154 



SEQIDNO: 12 



Database Name 

nr 

Description 

All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental 
samples from WGS projects 
Program 

BLASTP 2.2.20+ Citation 



Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb 
Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs", Nucleic Acids Res. 25:3389-3402. 

Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro 
A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted 
substitution matrices", FEBS J. 272:5101-5109. 
Other reports 

Search Parameters 

Program blastp 

Word size 3 

Expect value 10 

Hitlistsize 100 

Gapcosts 11,1 

Matrix BLOSUM62 

Threshold 1 1 
Composition-based stats 2 

Filter string F 

Genetic Code 1 

Window Size 40 



Posted date Jun 3, 2009 5:41 PM 

Number of letters 3,072,644,098 
Number of sequences 8,977,1 05 
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Entrez query none 



Karlin-Altschul statistics 

Params Ungapped Gapped 

Lambda 0.329052 0.267 

K 0.141985 0.041 

H 0.532528 0.14 



Results Statistics 

Length adjustment 1 1 6 

Effective length of query 38 
Effective length of database 2031299918 
Effective search space 771 89396884 
Effective search space used 771 89396884 

Graphic Summary 

Putative conserved domains have been detected, click on the image below for detailed results. 
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Distribution of 100 Blast Hits on the Query Sequence 



[21 



An overview of the database sequences aligned to the query sequence is shown. The score of each 
alignment is indicated by one of five different colors, which divides the range of scores into five groups. 
Multiple alignments on the same database sequence are connected by a striped line. Mousing over a hit 
sequence causes the definition and score to be shown in the window at the top, clicking on a hit 
sequence takes the user to the associated alignments. New: This graphic is an overview of database 
sequences aligned to the query sequence. Alignments are color-coded by score, within one of five score 
ranges. Multiple alignments on the same database sequence are connected by a dashed line. Mousing 
over an alignment shows the alignment definition and score in the box at the top. Clicking an alignment 
displays the alignment detail. 



Color key for alignment scores 
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Sequences producing significant alignments: 



>gb| AAT904. 



>gb | AA. 



pre-S/S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
large surface antigen [Hepatitis B virus] 
F,2'i?624.„J large/middle/small S proteins [Hepat. 
large S protein [Hepatitis B virus] 
LHBs [HBV genotype A] 

preSl/preS2/S surface protein [Hepatitis B vi . 

large S protein [Hepatitis B virus] 
AO... H& ; 7>3 RecName : Full=Large envelope protein;, 
envelope protein [Hepatitis B virus] 

large surface protein [Hepatitis B virus] 
PreSl-S protein [Hepatitis B virus] 
large surface protein [Hepatitis B virus] 
large surface antigen [Hepatitis B virus] 

large S protein [Hepatitis B virus] 
preS/S protein [Hepatitis B virus] 

surface antigen [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatiti 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

PreSl+PreS2+HBsAg [synthetic construct] 

pre-Sl protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 
preS/S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 

large S protein 

large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 

surface protein S [Hepatitis B virus] 
large S protein 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein 
F0S0841....4 PreSl+PreS2+HBsAg [Hepatitis B virus] 
large surface antigen [Hepatitis B virus] 

pre-S/S protein [Hepatitis B virus] 
F 6 V: large/middle/small ; 

large S protein [Hepatitis B - 

large S protein [Hepatitis B ■ 

large S protein [Hepatitis B ■ 

large S protein [Hepatitis B - 

Bl wild-type molecule; 0RF1 [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 

envelope protein [Hepatitis B virus] 

large/middle/small S proteins [Hepat. 



(Bits) Value 



4e-66 
5e-66 



proteins [Hepat. 



>gb|ABQ43 8. 



5e-66 
5e-66 
5e-66 
5e-66 
5e-66 
5e-66 
5e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 



6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
6e-66 
7e-66 
7e-66 
8e-66 
8e-66 
8e-66 
8e-66 
8e-66 
8e-66 



9e-66 
9e-66 



9e-66 
9e-66 



le-65 
le-65 
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large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
pre-S2/S protein [Hepatitis B virus] 

Large S protein [Hepatitis B virus] 
surface protein [Hepatitis B virus] 

large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
F030842...4 PreSl+PreS2 + HBsAg [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
surface protein [Hepatitis B virus] 

PreSl-S protein [Hepatitis B virus] 
large surface antigen [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
middle S protein [Hepatitis B virus] >gb|AAC58. 
middle S protein [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
middle surface protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 

PreSl-S protein [Hepatitis B virus] 
S protein [Hepatitis B virus] 

middle S protein [Hepatitis B virus] >gb|ACD65. 

Large S protein [Hepatitis B virus] 
large S protein [Hepatitis B virus] 
preSl [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
preSl [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
pre-S2 protein [Hepatitis B virus] >gb|ABI1549. 
S protein [Hepatitis B virus] 
HBsAg [Hepatitis B virus] 
preSl [Hepatitis B virus] 
middle S protein [Hepatitis B virus] 
pre-S2 protein [Hepatitis B virus] 
S protein [Hepatitis B virus] >gb | AB041296 . 1 I . 
HBsAg [Hepatitis B virus] 

S protein [Hepatitis B virus] >gb | AB04126 8 . 1 I . 
pre-S2 protein [Hepatitis B virus] 



le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
le-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
2e-65 
3e-65 
3e-65 
3e-65 
3e-65 
3e-65 
3e-65 



; Select All >. x\\\v >v N . ^ 1 s o :%uKs K/Mupie sMgnmsni 



>gb | AAD37942.1| pre-S/S protein [Hepatitis B virus] 
Length=3 8 7 

Score = 253 bits (647), Expect = 3e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 215 274 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 275 T....P S 334 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 335 . E Q 368 
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Score = 253 bits (647), Expect = 3e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . E Q 370 

>gb| AAZ23006 . 1 | large S protein [Hepatitis B virus] 
Length=398 

Score = 253 bits (646), Expect = 4e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 215 274 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct -~ r ; T . . . . P S 33 4 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 335 . E Q 368 

>gb|AAC58025.2 | large surface antigen [Hepatitis B virus] 
Length=389 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRF I IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 206 265 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 266 T....P S 325 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 326 .E Q 359 

>gb| AAK97200 . 1 | AF297624_4 large/middle/small S proteins [Hepatitis B virus] 
Length=389 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 206 265 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 266 T P S 325 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 326 .E Q 359 

>gb | AAZ23008 . 1 | large S protein [Hepatitis B virus] 
Length=398 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 215 274 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 275 T....P S 334 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 335 . E Q 368 

>dbj | BAD91280.il LHBs [HBV genotype A] 
Length=400 
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Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . E Q 370 

>emb | CAF2 8 8 7 3 . 2 | preSl/preS2/S surface protein [Hepatitis B virus] 
Length=396 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 213 272 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 273 T....P S 332 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 333 . E Q 366 

>gb| ACH86089 . 1 | large S protein [Hepatitis B virus] 
Length=390 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRF I IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . E Q 370 

>sp | P03141 . 3 | HBSAG_HBVA3 RecName : Full=Large envelope protein; AltName: Full=Lar 
protein; AltName: Full=L glycoprotein; AltName: Full=Major 
surface antigen; AltName: Full=Large S protein; AltName: 
Full=L-HBsAg; Short=LHB 

emb | CAA26539 . 1 | unnamed protein product [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 27- T .... P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . E Q 370 

>db j | BAF48757 . 1 | envelope protein [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . E Q 370 
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>gb| ABH04415 . 1 | large surface protein [Hepatitis B virus] 
Length=398 

Score = 253 bits (645), Expect = 5e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 215 274 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 275 T....P S 334 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 335 .E Q 368 

>dbj | BAG68757 . 1 | PreSl-S protein [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 6e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . 33 Q 370 

>db j | BAD02316 . 1 | large surface protein [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 6e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRF I IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 .33 Q 370 

>emb | CAC51294 . 1 | large surface antigen [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 6e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 60 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 27- T .... P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . 33 Q 370 

>gb| ACF95017 . 1 | large S protein [Hepatitis B virus] 
Length=400 

Score = 253 bits (645), Expect = 6e-66, Method: Compositional matrix adjust. 
Identities = 149/154 (96%), Positives = 150/154 (97%), Gaps = 0/154 (0%) 

Query 1 GGSPVCLGQNSQSPTSNHSPTSCPPICPGYRWMCLRRFI IFLFILLLCLIFLLVLLDYQG 6 0 
Sbjct 217 276 

Query 61 MLPVCPLIPGSTRTSTGQCKTCTTPAQGNSMFPSCCCTKPTDGNCTCIPIPLSWAFAKYL 120 
Sbjct 277 T....P S 336 

Query 121 WVWASVRFSWLSLLVPFVRWFVGLSPTVWLSAIW 154 
Sbjct 337 . 33 Q 370 
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